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Claims 

1 . An image data compression method characterized in that, 

the image data are sectioned into blocks and orthogonal-transformed, at the time of 

quantizing this orthogonal transformed image data and compressing the image data, 

the center value for the dispersion of the above-mentioned image data for one screen is 

determined, 
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the encoded information quantity is estimated according to this center value for 
dispersion, 

by means of this encoded information quantity, the parameters are determined at the time 
of quantizing for encoding at the desired encoding information quantity. 

2. An image data compressing device characterized in that is equipped with; 

a block sectioning means that sections the input image data into blocks 

an orthogonal transform means that orthogonal transforms the above-mentioned image 
data sectioned into blocks by the block [sectioning] means, 

a quantizing means that quantizes the above-mentioned image data that have been 
orthogonal transformed by means of this orthogonal transform means, 

a center value determining means that determines the center value across one screen for 
the dispersion of the data within the block wherein the above-mentioned image data for one 
screen were made a block by the above-mentioned block sectioning means, 

an encoded information quantity estimating means that estimates the encoded information 
quantity according to this center value for dispersion, 

and a parameter determining means that determines the parameters that control the 
above-mentioned quantizing means according to this encoded information quantity. 

3. The image data compression method described in Claim 1 characterized in that 

the above-mentioned parameters are determined by means of 

Mathematical formula 1 

f(x)={(a+d)/(B a -ed)}-c 
(B a is the desired encoded information quantity, d is the quantity showing the properties of the 
image, c, d, and e are coefficients). 

Detailed explanation of the invention 
[0001] 

Industrial applications 

This invention relates to an image data compression method and its device that conducts 
compresses images data while recording the image data on an opticaVmagnetic disk or the like, 
or transmitting [the image data] using telephone lines. 

[0002] 

Prior technology 

In regard to still pictures within the methods that compress image data, there is the 
international standard of JPEG (Joint Photographic Experts Group). An abbreviated construction 
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diagram for a conventional image data compression device carries out compression for the JPEG 
baseline mode is shown in Figure 1 . The image data compression device shown in Figure 1 is 
equipped with an input terminal (1) to which the digital transform image signal is input, a block 
segmenter (2) that segments this image signal into blocks, a DCT transformer (3) that discrete 
cosine transforms (DCT: discrete cosine transform) [Term given in Japanese and English] each 
signal component of the divided blocks, a quantizing means (5) that rounds the data, quantizing 
by means of a quantizing table (4) [that quantizes] each transform signal component, an entropy 
encoder (6) that entropy encodes the quantized data, and an output terminal (7) from which the 
entropy encoded data are output. 

[0003] 

In this image data compression device, an image signal that is digital transformed is input 
to the input terminal (1), this image signal is sent to the block divider (2), and divided into blocks 
of 8 pixels x 8 pixels. Each signal component of the divided blocks is sent to the DCT 
transformer (3), and DCT, in other words, orthogonal transformed. As for each coefficient for 
each orthogonal transformed signal component, the quantized data are rounded using the 
quantizing table (4). The quantized data are sent to the entropy encoder (6), are entropy encoded 
by means of a Huffman table, and after being variable length encoded, are output to the output 
terminal (7), and recorded on an optical/magnetic disk or the like. Restoration as an image signal 
can be carried out by reversing the compression operation. 

[0004] 

The purpose of orthogonal transform such as DCT is to divide the image signal into 
low-frequency components and high-frequency components. Low frequency components 
comprise the major portion, but high-frequency components have a large meaning from the 
information standpoint, and is also greatly influenced by the compression rate, in other words, 
the encoded information quantity. The orthogonal-transformed image data are rounded after 
quantizing, but the compression rate for the data changes varies with how efficiently they are 
quantized. As for the quantizing table used in the quantizing operation, in the JPEG baseline 
mode, one in an 8 x 8 coefficient matrix (weighting coefficient), exemplified by JPEG, and 
multiplied by a constant k (scaling factor), is used as a quantizing step size for each coefficient. 
The greater the scaling factor that designates the picture quality, the more the information 
quantity is reduced after quantizing, and therefore, The higher the compression rate, but because 
the quantizing is coarse, the greater the degree of deterioration of the restored image. 
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In the JPEG standards, the scaling factor is not limited to a specific value, but determined 
by the user; in other words, by the method that uses that standard. In quantizing within JPEG 
standards, if v is taken as the horizontal spacing frequency, v as the vertical spacing frequency, 
each DCT coefficient as S v u, the quantizing step size as Qvu, the quantizing value (the 
quantizing level) as Sqvu, and the weighting coefficient as Wv u, and the scaling factor as k, it 
can be expressed as; 

Mathematical formula 2 

Sqvu = round (Svu+l/2Q V u (Svu > = 0) 

Mathematical formula 3 

Sqvu = round (S v u- l/2Q v u (Svu < 0) 
(Here, the term round (val) shows a integer value the closest to the actual val). 

Mathematical formula 4 
Qvu = Wvu X k 

In other words, the quantizing step size (spacing of the quantizing level) that is used in each DCT 
coefficient is generated from a coefficient matrix (weighting coefficient) in which different 
values are allocated in response to the visual sensitivity of a person in relation to the quantizing 



[00061 In the JPEG baseline mode, the Huffman encoding method is used as the method for the 
entropy encoding. 

At the time of quantizing, an error; namely, a distortion, is created for the purpose ot 
rounding the data. The function of the encoded information quantity and the distortion at the time 
of quantizing is known in information language as the "rate distortion function," and it becomes a 
curve like that of Figure 2. The encoded information quantity is always positive, and has the 
distinctive feature that it becomes a downwardly convex function that logarithmically decreases 
monotonically, to the distortion quantity. That which is equivalent to the distortion of Figure 2 is 
the scaling factor. 
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[0008] 

Problems the invention is to solve 

In actually compressing a still image and recording and saving it on an optical/magnetic 
disk or the like, a method such as mentioned above is used for compression, but the compression 
rate will very with how the scaling rate is determined. In other words, the compression rate for 
the image cannot be understood before compressing the image, and instances occur of the 
compression having to be redone at the time of saving. 

[0009] 

The present invention solves the above-mentioned unfavorable circumstances, and 
controls the encoded information quantity by automatically setting the scaling factor for the to 
obtain the desired encoded information quantity at the time of compressing the image data. Its 
purpose is to offer an image data compression device wherein the compression rate for the image 
can be predicted before the actual compression. 

[0010] 

Means for solving the problem and its operation 

The image data compression method of the present invention is characterized in that it 
segments into blocks and orthogonal transforms the image data, quantizes these orthogonal- 
transformed image data, and at the time of compressing the image data, determines the center 
value for the dispersion within the blocks of the above-mentioned image data for one screen, 
estimates the encoded information quantity according to this center value, and determines the 
scaling factor at the time of quantizing according to this estimated information quantity. 

[0011] 

Application examples 

An application example of the present invention is explained in detail by referencing the 
figures An outline construction diagram of an image data compression device of this invention 
that compresses for the JPEG baseline mode is shown in Figure 3. The image data compression 
device shown in Figure 3, together with being equipped with an input terminal (1 1) to which the 
image signal is input, a block segmenter (12) that segments this image signal into blocks, a DCT 
transformer (13) that discrete cosine transforms each signal component of the divided blocks, a 
quantizer (15) that rounds the data quantized by means of a quantizing table (14) [that quantizes] 
each transformed signal component, an entropy encoder (16) that entropy encodes the quantized 
data, and an output terminal (17) from which the entropy encoded data are output, is further 
equipped with a dispersion calculator (18) that finds the dispersion in each block in relation to 
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the image signal that has been divided into blocks, a center value determining device (19) that 
finds the center value after the dispersion is found, an encoded information estimating device 
(20) that estimates the encoded information quantity according to this center value, and a scaling 
factor determining means (21) that determines the scaling factor that creates the quantizing table 
(14) by using this encoded information quantity. 

[0012] 

The operation of the image data compression device of this example is explained. When a 
digital transform image signal is input to the input terminal (1 1), it is sent to the block segmenter 
(12) and segmented into blocks of 8 pixels x 8 pixels. The image signal that has been segmented 
into blocks is sent to the DCT transformer (13), discrete cosine transformed, and divided based 
on frequency components. The image signal that has been divided based on frequency 
components is sent to the quantizer (15), but immediately after this, the scaling factor is 
determined by the scaling factor determining device (21) without conducting the quantizing, and 
pauses for quantizing at the quantizer (15) until the quantizing table (14) is created. The 
dispersion calculator (18) finds the dispersion at each block for the image signal that has been 
divided into blocks. When the dispersion for all of the blocks for one screen is found, the center 
values are sent to the encoding information quantity estimating device (20), and the encoded 
information quantity is estimated by a method such as explained later. This encoded information 
quantity is sent to the scaling factor determining device (21), and the scaling factor is 
determined. The quantizing cable (14) is created based on this scaling factor. As for the 
above-mentioned image signal, the data are rounded using quantizing by means of the quantizing 
table (14). The quantized data are sent to the entropy encoder (16), are entropy encoded 
according to a Huffman table, and after being variable length encoded, are output to the output 
terminal (17), and recorded on an optical/magnetic disk or the like. 

[0013] 

As was explained above, in this example, since a difference in the scaling factor is not 
generated due to human operations of the device, user there is also no generation of the situation 
in which the setting of the scaling factor is redone, and the image data can be maintained at a 
fixed encoded information quantity without regard to differences in the input image signal. In 
other words, since the compression rate for the image can be predicted before compression, there 
is no necessity to redo the compression. 
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[0014] , 

A method of determining the scaling factor is explained. Figure 4 illustrates an example 
in which the encoded information quantity is actually measured for the majority of images by 
using a fixed scaling factor. In this example, the data are taken using one as the scaling factor, 
and plotted. The functions for the center value of the dispersion and the encoded information 
quantity can be approximated from Figure 4 as a parabola for a quadratic function and if the 
predicted coded quantity for the case wherein it is encoded with a quantizing table for a JPEG 
illustration is taken as B d , the center value of the dispersion within the block of the image as V m , 
and a and b are taken as the function for the scaling factor, then B d can be approximated as in 
Equation 1. 

Mathematical formula 5 

B d = (a-V m ) 1/2 +b (1) 
In Equation 1 (a-Vm) 1 ' 2 shows the center values of the standard deviation within the blocks of 
the image This approximation is carried out by the encoded information quantity estimating 
device (20) (Figure 3), and the predicted coded quantity (B d ) is determined using the curve of 
Figure 4 from the center value (B J that was found by the center value determining device (19). 

[0015] The fact is clear that the relationship between the encoded information quantity and the 
scaling factor becomes like the curve shown in Figure 2, but the calculation time becomes long if 
the scaling factor is found using this curve, and this is not practical. Figure 5 shows a curve that 
approximates the curve of Figure 2 using a hyperbolic function. As will be explained later, the 
scaling factor value (Q a ) is found from the encoded information quantity (B a ) found in this 
manner using the curve of Figure 5. Here, if the encoded information quantity for the designation 
is taken as (B a ), the scaling factor that is.found as (Q a ), the coefficients that designate the 
encoded information quantity as (a, c, d, and e), (B a ) can be approximated as in Equation 2. 

Mathematical formula 6 

B =( a +d)/(Q a +c)+ea ( 2 > 
To solve Equation 2 for the coefficient (a), it becomes Equation 3 . Now then, (Q d ) is the scaling 
factor value (= 1) that becomes the JPEG illustration quantizing table. 



Mathematical formula 7 

a={B d (Q d +c)-d}/{e(Q d +c)+l} (3) 
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The determination of the coefficient a by means of this approximation and predicted encoded 
quantity (B d ) are also conducted at the encoded information quantity estimating device (20), the 
encoded information quantity (B a ) is estimated, and the estimated encoded information quantity 
(B a ) and the coefficient (a) are sent to the scaling factor determining device (21). 

[0016] 

When the coefficient (a) is determined, the scaling factor (Q a ) is found using the curve 
shown in Figure 5 in the scaling factor determining device (21). The coefficient (a) is substituted 
in Equation 2, and if it is solved for the scaling factor (Q a ) and that is found, it becomes Equation 
4. 

Mathematical formula 8 

Q a = (a+d)/(B a -e a )-c t 4 l 
The quantizing table (14) is constructed using the scaling factor (Qa) that was found from the 
curve in Figure 5, and the quantizing of the above-mentioned image signal is conducted using the 
quantizer (15). 

[0017] , , 

The above-mentioned coefficients (a, b, c, d, and e) are coefficients for the purpose of 
approximating a simple correlation of experimental values for the predicted encoded quantity 
(B d ) and the encoded information quantity (B a ). As for the coefficient values, values are 
determined so that a correlation graph of the predicted encoded quantity (B d ) and the encoded 
information quantity (B a ) can be superimposed on the curves of these experimental value graphs. 
In this example, a = 1/2800, b = 1/10, c = 4, d = 8, and e = 1/50 are found. Fractions are shown 
because they are displayed as integer values for the purpose of making the calculations high 
speed. 

[0018] 

When the scaling factor is determined as was explained above, the calculations that 
required [a long] time in the past can be conducted in a short time, and therefore, at the time of 
constructing the device, it becomes possible to construct it simply and inexpensively. 

[0019] 

Effects of the invention 

As was explained above, according to the image data compression method and its device 
of the present invention, since there is no fear of generating differences in parameters due to 
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human operation by the user of the device, there is no generation of a worst-case scenario 
wherein the setting of the parameters is redone; thus image data can be saved at a fixed picture 
quality regardless of the recording medium. Since the compression rate for the image is predicted 
before compression, the necessity to redo the compression is eliminated. In the event of 
conducting calculations for a high standard relationships such as "rate-distortion relationship, 
because a large amount of time is required with a small type computer such as a personal 
computer, a medium size or larger computer becomes necessary, and at the time of constructing 
the device, the device becomes large scale, but if approximate equations are used in the image 
data compression method and its device according to this invention, the effect is to carry out 
calculations in a shortime that required [a long] time in the past, can be conducted m a short time, 
and at the time of constructing the device, the construction can simple and inexpensive. Also, 
because the image data compression method and its device of this invention the calculating speed 
can be accelerated using approximate equations, the image data compression method and its 
device of this invention can respond not just to still images, but also to quasi-dynamic images 
such as television conferences and the like. 

TWf descri ption of the invention 

Figure lis an outline construction diagram of an image data compression device used in 
the past for compression processing in JPEG baseline mode. 

Figure shows the relationship of the coding parameters and the encoded information 

^^Figure 3 is an outline construction diagram of an image data compression device of the 
present invention that conducts compression processing in JPEG baseline mode. 

Figure 4 is an example wherein the encoded information amount is measured for a large 
number of images using a fixed scaling factor. 

Figure 5 is a figure showing the curve for the case wherein a hyperbolic function is used 
to approximate the curve of Figure 2. 
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Figure 3 
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